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6 

7 



9 O2 0> TITLE OF INVENTION: 0 1 i gomer i zat ion ot Hepatitis Delta 



12 -.130* FILE REFERENCE: 0 7 2 5 . 1 0 5 ti ■ 0 0 J 

II -110 > CURRENT APPLICATION NUMBER: 0 9/3TM75 

IS -14 1> CURRENT FILING DATE: 1999 -07-0 1 

17 -'.150:- PRIOR APPLICATION NUMBER: 60/091, 609 

18 -U51> PRIOR FILING DATE: 1998-07-02 
2 0 <16 0> NUMBER OF SEQ ID NOS : 3 5 

22 -:170> SOFTWARE: Fast SEQ for Windows Version 4.0 
24 -:210-* SEQ ID NO: 1 
2 5 OIL LENGTH: 49 

2 6 :212> TYPE: PRT 

27 ;213> ORGANISM: Hepatitis Delta Virus 
29 •:400> SEQUENCE: 1 

3 0 Gly Arg Glu Asp lie Leu Glu Gin Trp Val Ser Gly Arg Lvs Lvs Leu 
31 1 5 10 15 

3 2 Glu Glu Leu Glu Arg Asp Leu Arg Lys Leu Lys Lys Lvs lie Lvs Lys 
3 3 2 0 2 5 3 0 

34 Leu Glu Glu Asp Asn Pro Trp Leu Gly Asn lie Lys Gly lie lie Glv 
3 5 3 5 4 0 4 5^ 

3 6 Lys 

40 <210> SEQ ID NO: 2 

4 1 <211> LENGTH: 4 9 
4 2 v212> TYPE: PRT 

4 3 <213> ORGANISM: Hepatitis Delta Virus 
4 5 -.400> SEQUENCE: 2 

4 6 Gly Arg Glu Glu Val Leu Glu Gin Trp Val Asn Ser Arg Lvs Lvs Ala 

47 1 5 10 15 

4 8 Glu Glu Leu Glu Arg Asp Leu Arg Lys Thr Lys Lys Lys lie Lvs Lys 

4 9 2 0 2 5 3 0 

50 Leu Glu Asp Asp Asn Pro Trp Leu Glv Asn lie Lvs Gly lie Leu Gly 

51 35 40 ~ ^ 45~ 

5 2 Lys 

56 <210> SEQ ID NO: 3 

5 7 <2il> LENGTH: 4 9 

58 <212> TYFE: PRT 

59 <213> ORGANISM: Hepatitis Delta Virus 
61 <400> SEQUENCE; 3 

6 2 Gly Arg Glu Glu Val Leu Glu Gin Trp Val Ser Gly Arg Lvs Lvs Leu 
63 1 5 10 15 

6 4 Glu Glu Leu Glu Arg Asp Leu Arg Lys Val Lys Lys Lys lie Lys Lys 

65 20 25 30 

6 6 Leu Glu Asp Glu His Pro Trp Leu Gly Asn lie Lys Gly He Leu Gly 



10 



Ant igen 
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RAW SEQUENCE LISTING DATE: 08/02/2000 

PATENT APPLICATION: US/09/347, 175 TIME; 1. 2 : 5 7 : 0 1 

Input: Set; : A:\0725l056-001.txt. 

Output Set: N:\CRF3\08022000\I347175.raw 

6 7 3 5 4 0 4 5 

6 8 Lys 

7 2 <210> SEQ ID NO: <] 
7 3 <2II> LENGTH: 49 
7-1 <2I2> TYPE: PRT 

7 5 < 2 1 3 > 0 RG A N ISM: H ep a 1 1 tr. is Delta V :i. r u s 
7 7 <4 0 0> SEQUENCE: 4 

7 8 G 1 y A r g G 1. u G 1 u V a 1 Leu G 1 u Gin T 1: p V a 1. Ala G 1 y A r 9 A r g Lys Gin 
79 1 5 1.0 15 

8 0 Glu Giu Leu Glu Arg Asp Leu Arg Lys Thr Lys Lys Lys lie Lys Lys 
81 20 2 5 3 0 

8 2 Leu Glu Glu Glu Asn Pro Trp Leu Gly Asn lie Lys Gly lie Leu Gly 
8 3 3 5 1 0 " 4 5' 

8 4 Lys 

8 8 <210> SEO ID NO: 5 

8 9 <211> LENGTH: 4 8 

90 <212> TYPE: PRT 

91 <213> ORGANISM: Hepatitis Delta Virus 

9 3 <4 0 0> SEQUENCE : 5 

94 Thr Arg Glu Glu Thr Leu Glu Lys Trp lie Thr Ala Arg Lys Lvs Ala 

95 1 5 1.0 ' 15 

9 6 Glu Glu Leu Glu Lys Asp Leu Arg Lys Thr Arg Lys Thr lie Lys Lys 

97 20 25 * 30 

9 8 Leu Glu Glu Glu Asn Pro Trp Leu Gly Asn lie Val Gly lie lie Arg 

9 9 3 5 4 0 ' 4 5 ^ 
102 <210> SEQ ID NO: 6 

10 3 <211> LENGTH: 4 8 
104 <212> TYPE: PRT 

10 5 <213> ORGANISM: Hepatitis Delta Virus 
107 <4 00> SEQUENCE: 6 

1.08 Thr Arg Glu Glu Thr Leu Glu Lys Trp lie Thr Ala Arg Lys Lys Ala 

109 I 5 10 * 15 

110 Giu Glu Leu Glu Lys Asp Leu Arg Lys Ala Arg Lys Thr lie Lys Lys 

111 20 2 5 30 

112 Leu Glu Glu Glu Asn Pro Trp Leu Gly Asn He Leu Gly He lie Arg 

113 3 5 4 0 4 5" 

116 <210> SEQ ID NO: 7 

117 <211> LENGTH: 4 9 

118 <212> TYPE: PRT 

119 <213> ORGANISM: Hepatitis Delta Virus 

121 <4 00> SEQUENCE: 7 

122 Gly Arg Glu Gin He Leu Glu Gin Trp Val Asp Gly Arg Lys Lys Leu 

123 1 5 10 15 

124 Glu Glu Leu Glu Arg Asp Leu Arg Lys He Lys Lys Lys He Lvs Lys 

125 20 25 ^ 30 

126 Leu Glu Glu Glu Asn Pro Trp Leu Gly Asn Val Lys Gly He Leu Gly 

127 35 40 ^ 4 5^ 

128 Lys 

132 <210> SEQ ID NO: 8 



RAW SEQUENCE LISTING DATE: 08/02/2000 

PATENT APPLICATION : US/09/347 , 175 TIME : 12:57; 01 



I .npul Set : A:\072510 56-00.l .txt 

Output Set" : N:\CRF3\0 802200 0\I34 717 5 . raw 

133 <211> LENGTH: 4 9 
131 <212> TYPE: PRT 

135 v2I3> ORGANISM: Hepatitis Delta Virus 
137 <4 00> SEQUENCE: 8 

13 8 C 1 y Arg G 1 u G 1 u 1 1 e Leu G 1 u G 1 n T r p Va 1 A la G 1 v Arg Lv s L v s L e u 
139 1 5 10 15 

14 0 Giu Glu Leu Glu Arq Asp Leu Arg Lys Thr Lvs Lvs Lvs Leu Lys Lvs 
141 2 0 2 5 30 

112 lie Glu Asp Glu Asn Pro Trp Leu Gly Asn lie Lvs Giy lie Leu Giv 

113 3 5 4 0 "45 

114 Lys 

118 <210> SEQ ID NO: 9 
14 9 <211> LENGTH: 3 7 

150 <212> TYPE: PRT 

151 <213> ORGANISM: Artificial Sequence 

153 <22 0> FEATURE: 

154 <22 3> OTHER INFORMATION: Residues 12-48 of delta 12-60 ( Y ) 
I 5 6 < 4 00 > S EQUENCE : 9 

15V Gly Arg Glu Asp lie Leu Glu Gin Trp Val Ser Gly Arg Lvs Lvs Leu 

158 1 5 10 ' 15 

159 Glu Glu Leu Glu Arg Asp Leu Arg Lvs Leu Lys Lvs Lvs lie Lvs Lvs 

160 20 "25 30 

161 Leu Glu Glu Asp Asn 

162 35 

165 <210> SEQ ID NO: 10 

166 <211> LENGTH: 604 

167 <2I2> TYPE: DNA 

168 <213> ORGANISM: Artificial Sequence 

170 <220> FEATURE: 

171 <223> OTHER INFORMATION: Synthetic Gene for Optimised Expression of HDAg- 

172 in E. Coli 

174 <2 2I> NAME/KEY: CDS 

175 <222> LOCATION: (7)... (591) 

177 <4 0 0> SEQUENCE: 10 

178 gggcat atg age cgt age gaa cgt cgt aaa gat cgt ggc ggc cgt gaa 4 8 

179 Met Ser Arg Ser Glu Arg Arg Lys Asp Arg Gly Gly Arg Glu 



180 






1 








5 










10 












182 


gat 


att 


ct g 


gaa 


cag 


tgg 




age 


ggc 


cgt 


aag 


aag 


tta 


gag 


gaa 


ttg 


96 


183 


Asp 


lie 


Leu 


Glu 


Gin 


Trp 


Val 


Ser 


Gly 


Arg 


Lys 


Lys 


Leu 


Glu 


Glu 


Leu 




184 


15 










20 










25 










30 




186 


gaa 


cgt 


gat 


ctg 


cgt 


aaa 


ctg 


aaa 


aag 


aag 


att 


aag 


aaa 


ctg 


gaa 


gaa 


14 4 


187 


Glu 


Arg 


Asp 


Leu 


Arg 


Lys 


Leu 


Lys 


Lys 


Lys 


He 


Lys 


Lys 


Leu 


Glu 


Glu 




188 










35 










4 0 










4 5 






190 


gat 


aac 


ccg 


tgg 


ttg 


ggt 


a at 


att 


aaa 


ggc 


att 


att 


ggc 


aag 


aaa 


gat 


192 


191 


Asp 


Asn 


Pro 


Trp 


Leu 


Gly 


Asn 


lie 


Lys 


Gly 


He 


He 


Gly 


Lys 


Lys 


Asp 




192 








50 










55 










6 0 






194 


aaa 


gat 


ggc 


gaa 


ggc 


gcg 


ccg 


ccg 


gcg 


aag 


aaa 


ctg 


cgt. 


atg 


gat 


cag 


240 


19 5 


Lys 


Asp 


Gly 


Glu 


Gly 


Ala 


Pro 


Pro 


Ala 


Lys 


Lys 


Leu 


Arg 


Met 


Asp 


Gin 




196 






6 5 










7 0 










75 











Page 4 of 7 



RAW SEQUENCE LISTING DATE; 08/02/2000 

PATENT APPLICATION : US/0 9/34 7,17 5 TIME: 12:57:04 

Input Set : A:\0725L056-001.txt 

Output Set: N:\CRF3\08022000\l347175.raw 

198 at q gaa art gat gcg ggc ccg cgt aaa cgt ccg ctg cgt ggc ggc ttt 2B8 

199 Met Glu Tie Asp Ala Glv Pro Ai g Lvs Arg Pro Leu Arg Glv Glv Phe 
2 00 80 8 5 "90 

202 acc gat aag gaa cgt cag gac cat cgt cgt cgt aaa gcg ctg gaa aac 336 

20 3 Thr Asp Lys Glu Arg Gin Asp His Arg Arg Arq Lvs Ala Leu Glu Asn 

204 95 100 105 ' 110 

2 06 aaa rgt aaa cag ctg age age ggc ggc aaa tct ctg age cgt gaa gaa 384 

2 07 Lys Arg Lys Gin Leu Ser Ser Gly Glv Lys Ser Leu Ser Arg Glu Glu 

208 115 120 125 

210 gaa gaa gaa ctg aaa cgt ctg acc gaa gaa gat gaa aaa cgt gaa cgt 4 32 

211 Glu Glu Glu Leu Lys Arg Leu Thr Glu Glu Asp Glu Lys Arg Glu Arg 

212 130 135 * 14 0 

214 cgt att gca ggt cca tct gtt ggt ggt gtg aac ccg ctg gaa ggc ggc 480 

215 Arg lie Ala Gly Pro Ser Val Gly Glv Val Asn Pro Leu Glu Gly Glv 

216 14 5 150 155 

218 age cgt ggt gca ccg ggc ggt ggc ttt gtg ccg tct atg caa ggt gtt 528 

219 Ser Arg Gly Ala Pro Gly Gly Gly Phe Val Pro Ser Met Gin Gly Val 

220 160 165 170 

222 cca gaa age ccg ttt gcg cgt acc ggc gaa ggc ctg gat att cgt ggc 576 

22 3 Pro Glu Ser Pro Phe Ala Arg Thr Glv Glu Glv Leu Asp He Arg Gly 

224 175 180 185 190 

226 age cag ggc ttt ccg taaaccatgg cgc 604 

2 27 Ser Gin Gly Phe Pro 

228 195 

2 31 <210> SEQ ID NO: II 

2 32 <21I> LENGTH: 19 5 

2 33 <212> TYPE: PRT 

2 34 <213> ORGANISM: Artificial Sequence 
236 <220> FEATURE: 

2 37 <2 2 3> OTHER INFORMATION: Amino acid sequence encoded by synthetic Gene for 
2 38 Optimized Expression of HDAg-S in E. Coli 

240 <4 00> SEQUENCE: 11 

241 Met Ser Arg Ser Glu Arg Arg Lvs Asp Arg Glv Gly Arg Glu Asp He 

242 1 5 10 "l5" 

24 3 Leu Glu Gin Trp Val Ser Glv Arg Lvs Lys Leu Glu Glu Leu Glu Arg 

244 20 2 5 30 

24 5 Asp Leu Arg Lys Leu Lys Lys Lys He Lys Lys Leu Glu Glu Asp Asn 

246 3 5 4 0 4 5 

247 Pro Trp Leu Gly Asn He Lys Gly lie lie Gly Lys Lvs Asp Lvs Asp 

248 50 55 60 

249 Gly Glu Gly Ala Pro Pro Ala Lys Lys Leu Arg Met Asp Gin Met Glu 

250 65 70 75 80 

2 51 He Asp Ala Gly Pro Arg Lys Arg Pro Leu Arg Glv Glv Phe Thr Asp 

252 85 90 ~ 95 

2 53 Lys Glu Arg Gin Asp His Arg Arg Arg Lvs Ala Leu Glu Asn Lvs Arg 

254 100 105 110 

255 Lys Gin Leu Ser Ser Gly Gly Lys Ser Leu Ser Arg Glu Glu Glu Glu 

256 115 120 125 

2 57 Glu Leu Lys Arg Leu Thr Glu Glu Asp Glu Lys Arg Glu Arg Arg He 
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RAW SEQUENCE LISTING DATE: 08/02/2000 

PATENT APPLICATION: US/09/34 7,175 TIME: 12:57:01 

Input Set : A:\07251056-00i.txt 

Output Set: N:\CRF3\08022000\I347175.raw 

2 58 130 135 14 0 

2 59 Ala Gly Pro Ser Val Glv GIv Val Asn Pro Leu Glu Glv Glv Ser Arq 

260 145 1 50 155 ' " 160 
2 61 Gly Ala Pro Gly Gly Gly Phe Val Pro Ser Met Gin Gly Val Pro Glu 
262 16 5 170 " j 7 5 

26 3 Ser Pro Phe Ala Arg Thr Gly Glu Glv Leu Asp lie Arq Gly Ser Gin 

261 130 185 " 190 

26 5 Gly Phe Pro 
2 6 6 19 5 

2 69 <2I0> SEQ ID NO: 12 
2 70 <211> LENGTH : 167 9 

271 <212> TYPE: DNA 

272 <2i3> ORGANISM: Hepatitis Delta Virus 
2 74 <4 00> SEQUENCE: 1.2 

27 5 cttgaqccaa gttccgagcg aggagacgcg gggggaggat cagctcccga gaggggatgc 6 0 

27 6 cacggtaaag agcattggaa cgtcggagaa actactccca agaagcaaag agaggtctca 120 

277 ggaagcggac gagatcccca caacgocgga gaatctctgg aaggggaaag aggaaggtgg 180 

278 aagaaaaagg ggcgggcctc ccgatccgag gggcccaacc tccagatctg gagagcactc 240 

279 cggcccgaag ggttgagtag cacccagagg gaggaatcca ctcggagatg agcagagaaa 300 

280 tcacctccag aggacccctt cagcgaacaa gaggcgcttc gagcggtagg agtaagacca 3 60 
2 81 tagcgatagg aggagatgct aggagtaggg ggagaccgaa gcgaggagga aagtaaagaa 4 20 
282 ageaacgggg ctagccggtg ggtgttcegc cccccgagag gggacgagtg aggcttatcc 180 

28 3 cggggaattc gacttatcgt ccccatotag cgggaccccg gacccccttc gaaagtgace 54 0 

284 ggagggggtg ctgggaacac cggggaccag tggagccatg ggatgcccct cccgatgctc 600 

285 gactccgact ccccccccca agggtcgccc aggaatggcg ggaccccact ctgcagggtc 660 

286 cgcgttccat cctttcttac ctgatggccg gcatggtccc agcctcctcg ctggcgccgg 720 

287 ctgggcaaca ttccgagggg accgtcccct cggtaatggc gaatgggacc cacaaatctc 7 80 
238 tctagattcc gatagagaat cgagagaaaa gtggctctcc cttagccatc cgagtggacg 840 

289 tgcgtcctcc ttcggatgcc caggtcggac cgcgaggagg tggagatgcc atgccgaccc 900 

290 gaagaggaaa gaaggacgcg agacgcaaac ctgtgagtgg aaacccgctt tattcactgg 960 

291 ggtcgacaac tctggggaga aaagggcgga tcggotggga agagtatatc ccatggaaat 1020 

292 ccctggtttc ccctgatgtc cagcccctcc ccggtccgag agaaggggga ctccgggact 1080 

293 ccctgcagac tggggacgaa gccgcccccg ggcgctcccc tcgatccacc ttcgaggggg 114 0 

294 ttcacacccc caaccggcgg gccggctact cttctttccc ttctctcgtc ttcctcggtc 1200 

295 aacctcctga gttcctcttc ttcctccttg ctgaggttct tgcctcccgc cgatagctgc 1260 

296 ttcttcttgt totcgagggc cttccttcgt cggtgatcct gcctctcctt gtcggtgaat 1320 

297 cctcccctga gaggcctctt cctaggtccg gagtctacct ccatctggtc ogttcgggcc 1380 

298 ctcttcgccg ggggagcccc ctctccatcc ttatccttct ttccgagaat tcctttgatg 1440 

299 ttccccagcc agggattttc gtcctcaatc ttcttgagtt tcttctttgt cttccggagg 1500 

300 tctctctcga gttcctctaa cttctttctt ccggccaccc actgctcgag gatctcttct 1560 

301 ctccccccgc ggttcttcct cgactcggac cggctcatct cggctagagg cggcagtcct 1620 

302 cagtactctt actcttttct gtaaagagga gactgetgga ctcgccgccc gagcccgag 1679 

304 <210> SEQ ID NO: 13 

305 <211> LENGTH: 1683 

306 <212> TYPE: DNA 

307 <213> ORGANISM: Hepatitis Delta Virus 

309 <400> SEQUENCE: 13 

310 atgggccaag ttccgaacaa ggatccgcgg ggaggacgga tcacctcccg agaggggtaa 60 

311 gtcgctaaag agcattggaa cgtcggagat acaactccca agaaggaaaa aagagaaagc 120 



t 
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VERIFICATION SUMMARY DATE: 0 8/0 2/2 00 0 

PATENT APPLICATION : US/0 9 / 3 4 7 , 17 5 TIME: 12:57:05 

mput Set : A:\0725L056-001.txt 

Output; Set: N:\CRFJ\0802200 0\I3 4 717 5 .raw 

: 7 3 6 M : 3 6 1 W : I n va lid Split Codo n , Seq ue nc e data t o r S EQ IDs : 23 



